51 research outputs found
Machine Learning in Automated Text Categorization
The automated categorization (or classification) of texts into predefined
categories has witnessed a booming interest in the last ten years, due to the
increased availability of documents in digital form and the ensuing need to
organize them. In the research community the dominant approach to this problem
is based on machine learning techniques: a general inductive process
automatically builds a classifier by learning, from a set of preclassified
documents, the characteristics of the categories. The advantages of this
approach over the knowledge engineering approach (consisting in the manual
definition of a classifier by domain experts) are a very good effectiveness,
considerable savings in terms of expert manpower, and straightforward
portability to different domains. This survey discusses the main approaches to
text categorization that fall within the machine learning paradigm. We will
discuss in detail issues pertaining to three different problems, namely
document representation, classifier construction, and classifier evaluation.Comment: Accepted for publication on ACM Computing Survey
Conserved molecular interactions in centriole-to-centrosome conversion.
Centrioles are required to assemble centrosomes for cell division and cilia for motility and signalling. New centrioles assemble perpendicularly to pre-existing ones in G1-S and elongate throughout S and G2. Fully elongated daughter centrioles are converted into centrosomes during mitosis to be able to duplicate and organize pericentriolar material in the next cell cycle. Here we show that centriole-to-centrosome conversion requires sequential loading of Cep135, Ana1 (Cep295) and Asterless (Cep152) onto daughter centrioles during mitotic progression in both Drosophila melanogaster and human. This generates a molecular network spanning from the inner- to outermost parts of the centriole. Ana1 forms a molecular strut within the network, and its essential role can be substituted by an engineered fragment providing an alternative linkage between Asterless and Cep135. This conserved architectural framework is essential for loading Asterless or Cep152, the partner of the master regulator of centriole duplication, Plk4. Our study thus uncovers the molecular basis for centriole-to-centrosome conversion that renders daughter centrioles competent for motherhood.J.F., Z.L., S.S. and N.S.D. are supported from Programme Grant to D.M.G. from Cancer Research UK. H.R. is supported from MRC Programme Grant to D.M.G. J.F. thank the British Academy and the Royal Society for Newton International Fellowship and Z.L. thanks the Federation of European Biochemical Societies for the Long-Term postdoctoral Fellowship. The authors thank Nicola Lawrence and Alex Sossick for assistance with 3D-SIM.This is the author accepted manuscript. The final version is available from NPG via http://dx.doi.org/10.1038/ncb327
Ciliopathies: an expanding disease spectrum
Ciliopathies comprise a group of disorders associated with genetic mutations encoding defective proteins, which result in either abnormal formation or function of cilia. As cilia are a component of almost all vertebrate cells, cilia dysfunction can manifest as a constellation of features that include characteristically, retinal degeneration, renal disease and cerebral anomalies. Additional manifestations include congenital fibrocystic diseases of the liver, diabetes, obesity and skeletal dysplasias. Ciliopathic features have been associated with mutations in over 40 genes to date. However, with over 1,000 polypeptides currently identified within the ciliary proteome, several other disorders associated with this constellation of clinical features will likely be ascribed to mutations in other ciliary genes. The mechanisms underlying many of the disease phenotypes associated with ciliary dysfunction have yet to be fully elucidated. Several elegant studies have crucially demonstrated the dynamic ciliary localisation of components of the Hedgehog and Wnt signalling pathways during signal transduction. Given the critical role of the cilium in transducing “outside-in” signals, it is not surprising therefore, that the disease phenotypes consequent to ciliary dysfunction are a manifestation of aberrant signal transduction. Further investigation is now needed to explore the developmental and physiological roles of aberrant signal transduction in the manifestation of ciliopathy phenotypes. Utilisation of conditional and inducible murine models to delete or overexpress individual ciliary genes in a spatiotemporal and organ/cell-specific manner should help clarify some of the functional roles of ciliary proteins in the manifestation of phenotypic features
Measurement of moisture-related strain in bonded ash depending on adhesive type and glueline thickness
Structural wood-adhesive bonds (WAB) have to be durable while subjected to considerable stresses caused by mechanical loads and moisture content changes. To better understand the moisture-related durability of WABs, knowledge is important of how moisture changes generate strain in the bond. In this paper, strain on end-grain surfaces of bonded ash specimens was analyzed by means of digital image correlation. Strains were generated by wood shrinkage, and the evaluation was focused on shear strain (SStr). The bond lines were studied depending on the adhesive type – phenol resorcinol formaldehyde (PRF), melamine urea formaldehyde (MUF), polyurethane (PUR), and emulsion polymer isocyanates (EPI). Moreover, three different glueline (GL) thicknesses of MUF were taken into consideration. Comparing the adhesive types, SStr distributions (SStrD) were strongly influenced by adhesive elasticity. MUF and PRF bonds were quite rigid and were associated with pronounced strain amplitudes in and close to the GL together with strain dissipation reaching deep in the wood. PUR and EPI adhesives were more elastic and therefore allowed for smoother strain transition showing less distinct strain peaks. GL thickness had significant impact on SStrD. A high strain level and direct strain transition between adherends was found for the 0.01 mm GL, whereas a pronounced strain decrease was observed in the 0.1 and 0.2 mm GLs. This indicates different stress levels in the wood-adhesive interface dependent on GL thickness.ISSN:0018-3830ISSN:1437-434
Entwicklung und Anwendung des automatischen Indexierungssystems AIR/PHYS
SIGLECopy held by FIZ Karlsruhe; available from UB/TIB Hannover / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekDEGerman
AIR/X - a Rule-Based Multistage Indexing System for Large Subject Fields
AIR/X is a rule-based system for indexing with terms (descriptors) from a prescribed vocabulary. For this task, an indexing dictionary with rules for mapping terms from the text onto descriptors is required, which can be derived automatically from a set of manually indexed documents. Based on the Darmstadt Indexing Approach, the indexing task is divided into a description step and a decision step. First, terms (single words or phrases) are identified in the document text. With term-descriptor rules from the dictionary, descriptor indications are formed. The set of all indications from a document leading to the same descriptor is called a relevance description. A probabilistic classification procedure computes indexing weights for each relevance description. Since the whole system is rule-based, it can be adapted to different subject fields by appropriate modifications of the rule bases. A major application of AIR/X is the AIR/PHYS system developed for a large physics database. This application is described in more detail along with experimental results
- …